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NOVEL CHAIN TERMINATORS, THE USE THEREOF FOR NUCLEIC ACID 
SEQUENCING AND SYNTHESIS AND A METHOD OF THEIR PREPARATION 

The present invention relates to novel nucleic acid 
5 chain extension terminators, their use in nucleic acid 
sequencing and synthesis, respectively, as well as a 
method for preparing such compounds. 

Today, there are two predominant methods for DNA 
sequence determination: the chemical degradation method 
10 (Maxam and Gilbert, Proc. Natl. Acad. Sci. 24:560-564 

(1977), and the dideoxy chain termination method (Sanger 
et al., Proc. Natl. Acad. Sci. 21:5463-5467 (1977)). Most 
automated sequencers are based on the chain termination 
method utilizing fluorescent detection of product 
formation. In these systems either primers to which 
deoxynucleotides and dideoxynucleotides are added are dye- 
labelled, or the added dideoxynucleotides are 
fluorescently labelled. As an alternative, dye labelled 
deoxynucleotides can be used in conjunction with unlabeled 
dideoxynucleotides. This chain termination method is based 
upon the ability of an enzyme to add specific, nucleotides 
onto the 3 ' hydroxyl end of a primer annealed to a 
template. The base pairing property of nucleic acids 
determines the specificity of nucleotide addition. The 
extension products are then separated electrophoretically 
on a polyacrylamide gel and detected by an optical system 
utilizing laser excitation. 

Although both the chemical degradation method and the 
dideoxy chain termination method are in widespread use, 
there are many associated disadvantages. For example, the 
methods require gel-electrophoretic separation. Typically, 
only 400-800 base pairs can be sequenced from a single 
clone. As a result, the systems are both time- and labor- 
intensive. Methods avoiding gel separation have been 
developed in attempts to increase the sequencing 
throughput . 

Sequencing by hybridization (SBH) methods have been 
proposed by Crkvenjakov (Drmanac et al., Genomics 1:114 
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(1989); Strezoska et al.. (Proc. Natl. Acad. Sci. USA 
fifi:10089 (1991)), Bains and Smith (Bains and Smith, J. 
Theoretical Biol. 115.: 303 (1988)) and in OS-A-5 . 202 . 231 . 
This type of system utilizes the information obtained from 
5 multiple hybridizations of the polynucleotide of interest, 
using short oligonucleotides to determine the nucleic acid 
sequence. These methods potentially can increase the 
sequence throughput beacuse multiple hybridization 
reactions are performed simultaneously. To reconstruct the 
10 sequence, however, an extensive computer search algorithm 
is required to determine the most likely order of all 
fragments obtained from the multiple hybridizations. 

The SBH methods are problematic in several respects. 
For example, the hybridization is dependent upon the 
15 sequence composition of the duplex of the oligonucleotide 
and the polynucleotide of interest, so that GC-rich 
regions are more stable than AT-rich regions. As a result, 
false positives and false negatives during hybridization 
detection are frequently present and complicate sequence 
20 determination. Furthermore, the sequence of the 

polynucleotide is not determined directly, but is inferred 
from the sequence of the known probe, which increases the 
possibility for error. 

Methods have also been proposed which detect the 
25 addition or removal of single molecules from a DNA strand. 
For example, Hyman E.D. , Anal. Biochem. , 114:423 
(1988) discloses the addition of a nucleotide to a an 
immobilised DNA template /primer complex in the presence of 
a polymerase and determination of polymerisation reaction 
30 by detecting the pyrophosphate liberated as a result of 
the polymerisation. 

Jett et al., J . Biomol. Struct. Dyn., I, p. 301, 1989 
discloses a method wherein a single stranded DNA or RNA 
molecule of labelled nucleotides, complementary to the 
35 sequence to be determined, is suspended in a moving flow 
stream. Individual bases are then cleaved sequentially 
from the end of the suspended sequence and determined by a 
detector passed by the flow stream. 
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EP-A-223 618 discloses the use of an immobilised DNA 
template, primer and polymerase exposed to a flow 
containing only one species of deoxynucleotide at a time. 
A downstream detection system then determines whether 
5 deoxynucleotide is incorporated into the copy or not by 
detecting the difference in deoxynucleotide concentrations 
entering and leaving the flow cell containing the complex 
of DNA template and polymerase. 

WO 90/13666 proposes a method directly measuring the 
10 growth of the template copy rather than determining it 

indirectly from compositions in the flow medium. Only one 
of the four nucleotides is present at a time, and the 
polymerisation events reflecting the incorporation of a 
nucleotide or not are detected by spectroscopic means 
15 (evanescent wave spectroscopy, fluorescence detection, 

absorption spectroscopy) or by the individual nucleotides 
being labelled. 

Similar methods employing labelled 3 ' -blocked 
deoxynucleotides where the blocking group is removable and 
20 which thus permit sequential deoxynucleotide 

addition/detection steps are disclosed in WO 91/06678, US- 
A-5,302,509, DE-A-414 1178 and WO 93/21340. However, the 
necessary 3' -blocking groups are either not described in 
any detail, or are not accepted by the required enzyme, or 
25 do not permit desired rapid deblocking of the growing 
template copy strand after each polymerisation event. 

One object of the present invention is to provide 
novel nucleotide derivatives which may be used as chain 
terminators and which by deprotection may readily be 
30 converted into nucleotides or nucleotide analogues that 
may be further extended. 

Another object of the present invention is to provide 
a method for nucleotide sequence determination using the 
novel chain terminators. 
35 Still another object of the present invention \is to 

provide a method of synthesizing oligo- or polynucleotides 
by means of the novel chain terminators. 



WO &6/23807 



4 



PCT/SE96/00096 



Another object of the present invention is to provide 
a process of preparing novel chain terminators according 
to the invention. 

In accordance with the invention, these objects are 
5 achieved by the provision of a chain terminating 

nucleotide or nucleotide analogue having its 3 ' -hydroxy 1 
group protected by an acetal or thioacetal structure 
designed in such a way that the 3'-hydroxyl can be 
deprotected in a relatively short time in dilute acid, 
10 such as hydrochloric acid at pH 2, for example. 

in one aspect, the present invention therefore 
provides a compound of the general formula I: 
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25 or a salt thereof, such as a trimethylammonium, ammonium, 
sodium or potassium salt, wherein 
B is a nucleobase, 

X and Z independently are oxygen or sulphur, 

Y is hydrogen, hydroxy or protected hydroxy, such as 
30 methoxy, ethoxy or allyloxy, 

Rl is hydrocarbyl, which optionally is substituted 
with a functional group, 

R 2 is hydrogen or hydrocarbyl, which optionally is 
substituted with a functional group, 
35 A is an electron withdrawing or electron donating 

group capable of moderating the acetal stability of the 
compound I via l>\. 
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L X and L2 are hydrocarbon linkers, which may be the 
same or different, L»2, when present, being either (i) 
connected to Li via the group A, or (ii) directly 
connected to Li, the group A then being bound to one of 
5 linkers and L2, 

F is a dye label, 

Q is a coupling group for F, and 

1, m and n independently are 0 or 1, with the proviso 
that 1 is 1 when mis 1, and 1 is 1 and m is 1 when n is 
10 1. 

In Formula I above, the nucleobase B may be natural 
or synthetic. Natural nucleobases include common 
nucleobases, such as adenine, guanine, cytosine, thymine 
and uracil, as well as less common nucleobases, such as 

15 xanthine, hypoxanthine or 2-aminopurine. Synthetic 

nucleobases B are analogues to the natural nucleobases and 
capable of interacting with other nucleobases in a 
specific, hydrogen bond determined way. 

The hydrocarbyl groups represented by Ri and R2 

20 include a wide variety, including straight and branched 
chain alkyl, alkenyl, aryl, aralkyl and cycloalkyl, 
preferably containing up to 10 carbon atoms. Preferred 
hydrocarbyl groups are primary, secondary or tertiary 
alkyl, alkenyl or alkynyl groups, especially lower alkyl 

25 groups, such as methyl and ethyl. The optional functional 
group substituents on Ri and R 2 are capable of moderating 
the lability of the 3' acetal group through an inductive 
effect. Exemplary of such functional groups are tert. 
amino, nitro, cyano and halogen (fluoride, chloride, 

30 bromide, iodide) . 

Detectable moiety, or label F can be chosen from a 
vast number of such moieties known to those skilled in the 
art. Exemplary such moities are radioactively labeled 
functions, luminescent, electroluminescent or fluorescent 

35 labels, and labels that absorb characteristic visible or 
infrared light. Preferably, F is a fluorescent label. 

Coupling group Q, when n=0, is a reactive group to 
which a label F can be coupled, or, when n=l, is the 
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derivatized residue of a reactive group used for coupling 
linker L 2 to label F. A large number of such coupling 
groups capable of reacting with and binding label F via a 
reactive function thereon are known to those skilled in 
5 the art. Examples of reactive groups are amino, thio, and 
carboxyl. In some cases, the group Q is derived completely 
from the reactive function on the uncoupled F. For 
example, if the coupling reaction is a substitution 
reaction, the group reacting with F being, for example, a 
10 halogen (fluoride, chloride, bromide or iodide), then the 
group Q will be represented by the reactive function 
present on label F. 

For certain applications of the compounds of Formula 
I, as will be described below, no detectable moiety will 
15 be needed, and the label F, and optionally also the group 
Q, can then be omitted. 

Electron withdrawing or donating group A is 
incorporated into the structure as a moderator of acetal 
stability. The group A may represent a part of the chain 
20 L1-A-L2. Representative groups A in that case are amido, 

sulfoxy, sulfone, carbalkoxy (ester), ether, thioether and 
amino groups. Alternatively, A may be a side substituent 
for the Li-L 2 chain, representative groups then being, for 
example, cyano, nitro and halogen (halogen including 
25 fluoride, chloride, bromide and iodide) . In this latter 

case, linker Li will donate structure stretching from the 
acetal carbon to the place of the substitution, and linker 
L 2 will donate structure stretching from this substitution 
to the group A. It is to be emphasized that the specific 
30 electron withdrawing and electron donating groups 

mentioned above are only examples and that many more such 
groups are known and obvious to those skilled in the art. 

The structure of hydrocarbon linker Li will be 
selected with regard to the function A, inductive effects 
35 being highly depending on distance. While a straight 

aliphatic (saturated or unsaturated) hydrocarbon chain is 
preferred, branched or cyclic hydrocarbons may be 
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contemplated. Preferably, linker Li has up to 10 carbon 
atoms, more preferably up to 6 carbon atoms. 

The function of hydrocarbon linker L 2 is to provide, 
together with linker L^, A and coupling group Q, for a 
5 sufficiently long distance between the label moiety F and 
the rest of the compound of Formula I. This is required by 
spatial preferences of the enzymes (polymerases) for which 
the compound of Formula I acts as substrate or by the 
necessity of avoiding interaction between the label and 
10 nucleobase B. It is readily understood that the structure 
of linker L 2 highly depends on the particular enzyme, 
label F and possibly also nucleobase B that are used. A 
suitable length and structure of linker h 2 will therefore 
be selected by the skilled person for each particular 
15 situation. Similarly as for linker Li, a preferred 

structure for linker L 2 is a straight aliphatic (saturated 
or unsaturated) hydrocarbon chain, although branched or 
cyclic hydrocarbons may be contemplated. Preferably, 
linker L 2 has up to 10 carbon atoms, more preferably up to 
20 6 carbon atoms. 

In cases where the label F and the coupling group Q 
may be omitted, linker L2 may, of course, also be omitted. 

The compounds of Formula I can be deprotected to 
exhibit a free 3 '-hydroxy group in a relatively short time 
25 under acidic conditions, e.g. with hydrochloric acid at 
about pH 2. It is understood that by proper selection of 
the groups R 1# R 2 , A, Li and L 2 with regard to each other, 
the deprotection time may be adjusted to a desired range. 
Under the mentioned acidic conditions, the most preferred 
30 compounds of Formula I will be deprotected within, say, 
0.01 to 15 minutes. 

While the compounds of Formula I may be used as pure 
chain extension inhibitors, or chain terminators, for 
example, in DNA sequencing according to the chain 
35 termination method, as is per se known in the art, the 
advantages of the compounds are, of course, better 
benefited from when the convenient deprotection 
capabilities of the compounds are utilized. This is, for 
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example, the case when the compounds I are used in nucleic 
acid sequencing methods based on the sequential 
incorporation and determination of individual nucleotides 
in a growing nucleic acid copy strand as described in, for 
5 example, the aforementioned WO 91/06678, US-A-5, 302, 509, 
DE-A-414 1178 and WO 93/21340. 

Another aspect of the invention therefore provides a 
method for determining the sequence of a nucleic acid, 
which method comprises providing a single-stranded 

10 template comprising the nucleic acid to be determined, and 
at least partially synthesizing a complementary nucleic 
acid molecule in a stepwise serial manner by the addition 
of nucleotides in which the identity of each nucleotide 
incorporated into the complementary nucleic acid molecule 

15 is determined subsequent to its incorporation, wherein 
said nucleotides are compounds of Formula I as defined 
above, and wherein the 3' -blocking group is removed from 
the nucleotide after its incorporation to permit further 
extension of the nucleic acid molecule. 

20 In one embodiment, a method for determining the 

sequence of a nucleic acid comprises the following steps: 

(i) providing a single-stranded template comprising 
the nucleic acid to be sequenced, 

(ii) hybridising a primer to the template to form a 
25 template/primer complex, 

(iii) subjecting the primer to an extension reaction by 
the addition of compounds of Formula I with different 
nucleobases B corresponding the four bases A, C, T and G 
or analogues thereof, 

30 (iv) determining the type of the compound of Formula I 

added to the primer, 

(v) selectively hydrolysing the acetal protective 
group , and 

(vi) repeating steps (iii) to (v) sequentially and 
35 recording the order of incorporation of compounds of 

Formula I. 

The different compounds of Formula I in step (iii) 
may be added in sequence, in which case the four different 
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compounds I may carry the same label F. Alternatively, the 
different compounds I have different labels F and are 
added at the same time. 

In a preferred embodiment of the method of the 
5 invention, the template/primer complex is bound to a 
solid-phase support, such as a sequencing chip, for 
example. The template may be attached to the solid support 
via a binding linker, which, for instance, is ligated to 
the 5 ' -end of the template or incorporated in one of the 
10 ends of the template by polymerase chain reaction (PCR) . 
The binding linker may then be attached to the solid 
support by use of a streptavidin coupling system. 
Alternatively, the primer may be attached to the solid 
support . 

15 The compounds of Formula I may, of course, also 

conveniently be used in so-called mini -sequencing (see 
e.g. Syvanen A-C et al.. Genomics &:684-692 (1990). 

As is readily understood by a person skilled in the 
art, the compounds of Formula I may also be used in the 

20 synthesis of nucleotide chains, and another aspect of the 
invention relates to such use. For example, 
oligonucleotides and polynucleotides may be prepared by 
successively coupling compounds of Formula I to each other 
in any desired base order with intervening deblocking and 

25 using a non-template dependent polymerase, such as a 

terminal transferase. For such synthesis, the groups L2, Q 
and F in Formula I may, of course, be omitted. 

The compounds of Formula I may be prepared by methods 
known per se. In a further aspect, however, the invention 

30 provides a particular method for the preparation of a 
subgroup of compounds of Formula I by direct 3 ' -OH 
protection, and more particularly compounds of Formula la: 
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or a salt thereof, wherein B, X, Z, Ri« R2, Q< ^ < m and n 
are as defined above, and p and q independently are 
integers from 1 to 10, preferably from 1 to 6, by reacting 
a compound of Formula II : 

15 
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or a salt thereof, wherein B and X are as defined above, 
25 with a compound of Formula III: 

R 2 -C-CH-(CH 2 )p-l-COOR3 III 



30 

wherein Rj., R2. Z and p are as defined above, and R 3 is 
hydrocarbyl (for example, as defined for Ri and R 2 above) , 
to produce a compound of Formula IV: 
O O x 

35 HO-P-O-P-O-P-O-i^O B 

OH OH OH \ / IV 
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or a salt thereof, wherein B, X, Z, R 1( R 2 , R 3 and p are 
as defined above, optionally reacting the latter with a 
diamine H 2 N-.(CH 2 ) q -NH 2 , wherein q is as defined above, to 
produce a compound of Formula V: 



wherein B, X, Z, R 1# R 2 , p and q are as defined above, and 
optionally coupling a dye label F to the terminal amino 



In the following, the invention will be illustrated 
by some non-limiting examples. Reference will be made to 
the accompanying drawings, wherein: 

Fig. 1 is a reaction scheme for the enolether 
synthesis described in Example 1 below. Intermediate and 
final products corresponding to various values of integer 
"n" in the respective structural formulae are identified 
by numbers indicated within brackets after the n-values 
listed under the respective formulae. 

Fig. 2 is a reaction scheme for the preparation of 
3'-acetal-modified thymidine described in Example 2 below. 
Prepared final compounds corresponding to various values 
of integer "n" in the respective structural formula are 
identified by numbers indicated within brackets after the 
n-values listed under the formula. 

Fig. 3 is a graph showing thymidine 3 ' -acetal 
hydrolysis at pH 4 as log % of remaining acetal versus 
time in minutes. 

Fig. 4 is a reaction scheme for the one-pot synthesis 
of 3 • -acetal protected deoxynucleotide triphosphates 
described in Example 4 below. Intermediate and final 
products corresponding to various values of integer "n" in 




0 ^ N -^CH 2 ) q -NH 2 
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the respective structural formulae are identified by 
numbers indicated within brackets after the n-values 
listed under the respective formulae. 

EXAMPLE 1 

5 synthesis of enolethers for derivatization of nucleotide 

triphosphates (Fig. 1) 

ffrftr7 i Rsterifi ration of Kftt.pacids 

The appropriate ketoacid (1 eq) was added at once to 
a large excess of dry methanol (20 eq) , previously treated 

10 with thionyl chloride (0.1 eq) . The homogenous mixture was 
refluxed overnight, evaporated under reduced pressure, and 
partitioned between saturated sodium hydrogen carbonate 
and dichloromethane. The combined organic extracts were 
dried with magnesium sulphate, evaporated and the residue 

15 was distilled under reduced pressure to give the 

appropriate methyl ester in a high yield (80% to 95%) . 
The following ketoacids were used: 

1) 4-oxy pentanoic acid (levulinic acid) (Aldrich) 

2) 5-oxy hexanoic acid has not been isolated. Instead a 
20 commercial ethyl ester-derivative of this acid (Merck) was 

transesterified in a reaction analogous to the above 
general procedure. 

3) 6-oxy heptanoic acid was obtained from 2-methyl 
cyclohexanol (Merck) , according to a published procedure 

25 (Org. Synth. 21: 3-5, (1951)). 

4) 7-oxy octanoic acid was obtained from 2-acetyl 
cyclohexanone as described (J. Am. Chem. Soc. 1^:4023-4026 
(1948)) . 

s rm} ? amthsa ia " f the dimsihaaaacafcaJ derivatives 

30 Irnmnnun^ 1-d in Fio. 1) 

Benzenesulfonic acid (0.01 eq) was added to each of 
above methyl esters (1 eq) dissolved in methanol (2 eq) 
and trimethylorthoformate (3 eq) . The dark brown mixture 
was refluxed for 3 h, neutralized by addition of dry 

35 triethylamine (0.1 eq) . and evaporated. The residue was 
distilled under lowered pressure to yield a pure acetal. 
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Compound 1. The physical and NMR data for this acetal 
correspond well to the data reported previously, 

Con(pound 2. Yield 85%, bp. 120° (15 mm Hg) *H NMR (CDCI3) : 
5 1.28 (s, 3 H), 1.60-1.69 (m, 4 H) , 2.34 (t, 2 H) , 3.17 <s, 
6 H) , 3.67 (s, 3 H) . 

Compound 3. Yield 92%, bp. 130-133° (15 mm Hg) *H NMR 
(CDCI3): 1.25 (s, 3 H), 1.29-1.35 <m, 2 H) , 1.60-1.70 (m, 
10 4 H), 2.33 (t, 2 H), 3.17 (s, 6 H) , 3.67 (s, 3 H) . 

Compound 4. Yield 87%, bp. 147-149° (15 mm Hg) *H NMR 
(CDCI3); 1.25 (s, 3 H), 1.27-1.38 (m, 4 H) , 1.57-1,67 (m, 
4 H), 2.31 (t, 2 H), 3.16 (s, 6 H) , 3.66 (s, 3 H) . 

15 

Step 3. Synthesis of enolethers (compou nds 5-a in Fig, n . 

A mixture of the appropriate acetal (1 eq) and 
trimethylortoformate (0.5 eq) was placed in a distillation 
flask, equipped with a 20 cm Vigreux column. 

20 Benzenesulfonic acid (0.02 eq) was added and the mixture 
was refluxed. The heating was regulated so that the 
liberated methanol evaporated slowly. After 4 h the 
heating was increased and the dark mixture was 
fractionated under reduced pressure without previous 

25 neutralization of the acidic catalyst. The yields of 

isolated colorless enolethers were in all cases very high 
(85-95%) but GC and NMR analyses showed the presence of 
starting acetal in proportions from 10 to 30 %. Since 
these impurities were not expected to influence the 

30 derivatization of nucleotides no attempts were made to 
purify them further. The complete assignment of NMR 
signals was difficult because of the presence of starting 
material and the fact that these asymmetric enolethers 
exist as several isomeric forms. Nevertheless, in all 

35 spectra a vinylic signal of CH2 from one isomer at around 
4.4 to 4.5 ppm and a vinylic signal of CH from the other 
isomer at 3.85 ppm could be easily observed. 
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EXAMPLE 2 

Synthesis of 3' acetal -modified thymidine (Fig. 2) 

The 5 '-protected thymidine, 5'-FM0C T (FMOC-f luorenyl 
methoxycarbonyl) , (116 mg, 0.25 iranol) was dried by 

5 coevaporation with dry acetonitrile (10 ml) and dissolved 
in dry dioxane (5 ml). To this magnetically stirred 
solution an appropriate enolether (0.5 ml, * 10 eq) 
prepared in Example 1 was added, followed by 
trifluoroacetic acid (20 ml, 1 eq) . After 45 min, the TLC 

10 analysis (Silicagel 60 F254, 10% methanol in chloroform) 
showed complete consumption of the starting material, and 
dry triethylamine (2 ml) was introduced in order to remove 
the base-labile FMOC-group. After this process was 
completed (60 min), the whole mixture was evaporated under 

15 reduced pressure. The nucleoside was separated from the 

excess of enolether by precipitation from petroleum ether, 
and the precipitate, dissolved in 5 ml of dry methanol, 
was treated with 1, 3-diaminopropane (2 ml, 100 eq) at 60° 
for 180 min in order to effect the aminolysis of the ester 

20 function on the acetal moiety. Finally, the reaction 

mixture was evaporated under low pressure (oil pump) and 
the crude material was flash chromatographed on a silica 
gel column, equilibrated in ethanol and using a step 
gradient of cone, ammonium hydroxide (0-10%) in ethanol. 

25 The material, homogeneous by TLC (developed in 1-butanol - 
ammonium hydroxide 8:2), was combined, evaporated, and 
coevaporated with toluene, to give the NMR pure product 
(compounds 9-12 in Fig. 2) in high yield (72 to 85%). The 
pure material was stored as a stock solution in methanol 

30 after addition of three drops of ammonia. 

EXEMPLE 3 

Acidic hydrolysis (pH 4) of thymidine, substituted at the 
3* -position with different acetyl groups 

A reference acetate buffer pH 4.0, prepared by mixing 
35 of solutions of sodium acetate (0.20 M, 18.0 ml) with 

acetic acid (0.20 M, 82.0 ml) was used in all hydrolysis 
studies. To this buffer (10.0 ml) an ethanolic solution 
of the appropriate thymidine 3 • -acetal derivative prepared 
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in Example 2 (compounds 9-12 in Fig. 2) (100 ml) was added 
with gentle stirring. At different time points a sample of 
0.5 ml was withdrawn and placed in a tube containing 15 ml 
of cone, ammonia to raise the pH to 9. Samples were 
5 analyzed by FPLC using an anion exchange column (Mono Q - 
Pharmacia Biotech AB) and a gradient system of 
tetraethylammonium bicarbonate pH 8.2 for elution (0.05 to 
0.75 M) . The peak areas of starting material and its 
hydrolysis product thymidine were integrated, and plotted 
10 as shown in Fig. 3. 

EXAMPLE 4 

One pot-synthesis of deoaynucleotide triphosphates, 
protected at their 3 ' -OH position by a functionalized 
acetal group (Fig. 4) 

15 A commercial deoxynucleotide triphosphate (pppdT, 

PPPdC, pppdG, or pppdA) was chroma tographed on a 
preparative Mono Q column using a gradient system of 
tetraethylammonium bicarbonate pH 8.2 (0.05 to 1.3 M) to 
obtain the pure triphosphate in the form of a 
20 triethylammonium salt. This material was evaporated on a 
Rotavapor, coevaporated with dry acetonitrile (3x2 ml), 
and dissolved in molecular sieve-dried trimethylphosphate 
(0.5 ml). The appropriate enolether (compounds 5-8 in Fig. 
1) (0.2 ml) and trif luoroacetic acid (10 eq as a 10% 
25 solution in dry dioxane) were added. The homogeneous 

mixture was incubated at 20°C for 60 min. neutralized by 
the addition of triethylamine (100 ml), and precipitated 
from a 1:1 mixture of petroleum ether and diethyl ether. 
The oily precipitate was dissolved in methanol (2 ml) and 
1,3-diaminopropane (0.5 ml) or another diamine (1,4- 
diaminobutane, 1, 6-diaminohexane) (0.5 ml) was added. The 
ester function was subjected to aminolysis overnight at 
60°C. The mixture was again precipitated from 1:1 
petroleum ether and diethyl ether, washed with diethyl 
ether, and dissolved in water. The water solution was 
analyzed and preparatively purified on the described anion 
exchange column. The well resolved products (compounds 
13-28 in Fig. 4) always appear prior to the original 
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deoxynucleotide triphosphate, having a retention time 
comparable to that of the appropriate deoxynucleotide 
diphosphate (as was found from separate coinjection 
experiments). An aliquot of the isolated 3'-acetal- 
5 modified derivative was evaporated, treated with 80% 
acetic acid for 2 min, and, after evaporation of acid, 
reinjected to the same chromatographic system. In all 
cases all starting material was reacted and a single 
product with higher retention time was formed that 

10 corresponded to the original deoxynucleotide triphosphate. 
Thymidine triphosphate reacts to form, besides the desired 
3' -modified derivative, also another product with even 
shorter retention time but that also hydrolyses to the 
starting triphosphate during acid treatment. It is assumed 

15 that this product is the bis-3'-0, 4-O-acetal-derivative 
of thymidine triphosphate. This type of side products was 
not present in reactions that used other nucleotide 
triphosphates. It should also be stressed that very little 
products of depurination were formed in reactions in which 

20 pppdG and pppdA were used. This can be explained by the 
mild acid applied as a catalyst, the large excess of 
enolether used in the reaction, and the fact that the 
bases existed in an unprotected (more acid resistant) 
form. 

25 EXAMPLE 5 

Coupling of a fluorophoro to the 3 • -functionaliaed 
deoxynucleotide triphosphate (Fig. 4) 

This derivatization can be performed with a variety 
of reactive fluorophores that may differ in their 

30 reactivity and in their optimal reaction conditions. Here 
is described the procedure for labelling of 3'- 
functionalized deoxynucleotide triphosphate with 
fluorescein isothiocyanate . 

The appropriate nucleotide triphosphate, possessing 

35 an amino group attached to the 3 1 - position of the sugar 
residue via an acetal function (compounds 13-28 in Fig. 
4), was evaporated and dissolved in 0.1 M carbonate buffer 
pH 10 (0.5 ml). Fluorescein isothiocyanate (single isomer) 
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(10 eq), dissolved in dimethyl formamide (0.25 ml), was 
added and the mixture was incubated overnight at 20°C. The 
reaction mixture was applied on a prototype FPLC Superdex® 
gel filtration column (Pharmacia Biotech AB) (equivalent 
5 to Sephadex® G-10; Pharmacia Biotech AB) , equilibrated and 
run in TEA HC0 3 buffer (0.1 M) . The fluorescent material 
appearing in the void was collected (compounds 29-44 in 
Fig. 4). As expected, the original deoxynucleotide 
triphosphate was formed upon action of acid on the 
10 fluorescein- labelled nucleotide. 

EXAMPLE 6 

Stepwise enzymatical incorporation of 3* -modified 

triphosphates 

Four oligonucleotides were prepared, having sequences 
15 corresponding to the M13 sequencing primer, and designed 
to incorporate to their 3 ' -ends T, G, A or C base 
respectively : 

(T) 5 ' CGACGTTGTAAAACGACGGCCAG (23 mer) 

(G) 5 ' CGACGTTGTAAAACGACGGCCA (22 mer) 

20 (A) 5 ' CGACGTTGTAAAACGACGGCC (21 mer) 

(C) 5 ' CGACGTTGTAAAACGACGGC (20 mer) 

Each of them was radioactively labelled with 3 2p 
using T4 polynucleotidekinase . 

In order to investigate if the modified nucleotide 
25 triphosphates are accepted effectively and specifically by 
the T7 polymerase, each of the labelled primers was 
subjected to the single step extension conditions. These 
reaction mixtures (20 ml) were composed of the respective 
primer (0.1 pmol) , M13 template (1 pmol), and 3 1 -modified 
30 thymidine triphosphate (compound 16 in Fig. 4) in a buffer 
system pH 7.5 containing Tris/HCl (40 mM) , MnCl 2 (4 mM) , 
dithiothreitol (DTT) (11 mM) , sodium isocitrate (29 mM) 
and sodium chloride (23 mM) . The mixtures were denatured 
at 70°C, cooled, and incubated at 42°C after the addition 
35 of T7 DNA polymerase (10 U) for 30 min. All reactions were 
stopped by addition of formamide (20 ml), denatured at 
70 °C, cooled on ice, and separated on a 6% denaturing 
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sequencing gel. The developed autoradiogram showed that 
one-base extension had taken place solely in the reaction 
containing 23-mer-T (designed to prime the M13 template 
and incorporate thymidylic acid at its 3 •-end). None of 
5 the other reactions containing primers designed for 

incorporation of dG, dA and dC, respectively, showed any 
sign of primer extension. 

EXAMPLE 7 

Sequencing reaction with modified chain terminator 

10 All sequencing reactions were performed according to 

the standard ABI protocol (Taq Dye Primer Cycle Sequencing 
Kit - protocol 901482) . 

The sequencing reactions consisted of the following 
components: M13 mp 18 ss-DNA template (1 pmol) , 5'- 

15 fluorescein labelled universal primer (1 pmol, Pharmacia 
Biotech AB) , Ampli Taq DNA polymerase (4U) and T- 
termination mixture in cycle sequencing buffer (Tris-HCl 
pH 8.9, 80 mM, ammonium sulfate 20 mM, magnesium chloride 
5 mM) . Instead of the standard ddT triphosphate, compound 

20 16 in Fig. 4, in different proportions with TTP, was used 
as a chain terminator. After the cycling was completed, an 
equal volume of formamide was added and the mixture was 
heated for 2 min at 90°C, cooled on ice and loaded onto 6% 
acrylamide urea gel. Separation and analysis of products 

25 were performed on an A.L.F. DNA-sequencer (Pharmacia 

Biotech AB) . 

As a result, a pattern of fluorescent- labelled 
sequences was obtained. This electrophoretic picture was 
specific for the applied terminator. 
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0 0 X 

" ii M _ 
HO-P-O-P-O-P-0— , .0 B 

OH OH OH \ J 



R t -Z-C-L r (A) - (L 2 ) i- (Q) m - (P) : 
R 2 



or a salt thereof, wherein 
B is a nucleobase, 

X and Z independently are oxygen or sulphur, 
Y is hydrogen or hydroxy, which optionally may be 
protected, 

Rl is hydrocarbyl, which optionally is substituted 
with a functional group, 

R2 is hydrogen or hydrocarbyl, which optionally is 
substituted with a functional group, 

A is an electron withdrawing or electron donating 
group capable of moderating the acetal stability of the 
compound I, 

Li and L 2 are hydrocarbon linkers, which may be the 
same or different, L 2 , when present, being either (i) 
connected to Li via the group A, or (ii) directly 
connected to L 1# the group A then being connected to one 
of linkers Li and L 2 , 

F is a dye label, 

Q is a coupling group for P, and 

1, m and m independently are 0 or 1, with the proviso 
that 1 is 1 when m is 1, and 1 is 1 and m is 1 when n is 
1. 
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2. The compound according to claim 1. wherein R X and R 2 
are independently selected from primary, secondary and 
tertiary alkyl, optionally substituted with one or more 
functional groups. 

3. The compound according to claim 2. wherein Rx and R 2 
are selected from methyl and ethyl. 

4. The compound according to claim 1, 2 or 3, wherein A 
is a part of a chain Lx~A-L 2 and is selected from amido, 
sulfoxy, sulfone. carbalkoxy. ether, thioether and amino. 



5. The compound according to claim 1. 2 or 3, wherein A 
is a substituent to a chain L x -L 2 and is selected from 

15 cyano, nitro and halogen. 

6. The compound according to any one of claims 1 to 5. 
wherein Lx is a straight aliphatic hydrocarbon chain, 
preferably having up to 10 carbon atoms. 

20 

7. The compound according to any one of claims 1 to 6, 
wherein L 2 is a straight aliphatic hydrocarbon chain, 
preferably having up to 10 carbon atoms. 

25 8. The compound according to any one of claims 1 to 7. 
wherein Y is alkoxy. 

9. Use of compounds according to any one of claims 1 to 
8 in nucleic acid synthesis reactions, especially for 

30 preparing oligo- or polynucleotides. 

10. Use of compounds according to any one of claims 1 to 
8 in minisequencing . 

35 11. A method for determining the sequence of a nucleic 
acid, comprising providing a single- stranded template 
comprising the nucleic acid to be determined, and at least 
partially synthesizing a complementary nucleic acid 
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molecule in a stepwise serial manner by the addition of 
nucleotides in which the identity of each nucleotide 
incorporated into the complementary nucleic acid molecule 
is determined subsequent to its incorporation, wherein 
said nucleotides are compounds of Formula I as defined in 
any one of claims 1 to 8, and wherein the 3 ' -blocking 
group is removed from the nucleotide after its 
incorporation to permit further extension of the nucleic 
acid molecule. 

12. The method according to claim 11, wherein the 3'- 
blocking group is removed by acid hydrolysis. 

13. The method according to claim 11 or 12, wherein F in 
Formula I is a fluorescent label. 

14. The method according to any one of claims 11 to 13, 
wherein said single-stranded template is bound to a solid 
support . 

15. A process of preparing a compound of Formula la: 



ar a salt thereof, wherein B, X, z, R lf R 2 , Q, F, m and n 
are as defined in claim 1, and p and q independently are 
integers from 1 to 10, by reacting a compound of Formula 




p. N ^CH 2 ) q -(Q) ffi -<F) n 



II: 
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hoIo-pJ-o-i/n 11 

OH OH OH \ / 
OH 



or a salt thereof, wherein B and X are as defined above, 
with a compound of Formula III: 



10 

R 2 -OCH- (CH 2 )p-l-COOR 3 hi 



ZRi 



wherein R i( R 2 , Z and p are as defined above, and R 3 is 
15 hydrocarbyl, to produce a compound of Formula IV: 



O 0 x 



HO -P-O-P-O-P-O-wO B 
OH OH OH \ 7 

20 B V Rl ^ 

COOR3 

or a salt thereof, wherein B, X. Z, Ri, R 2 , R3 and P are 
25 as defined above, optionally reacting the latter with a 
diamine H 2 N- (CH 2 ) q -NH 2 , wherein q is as defined above, to 
produce a compound of Formula V: 



30 % 9 » . . g 

HO-P-O-P-O-P-O— 0 " 



OH OH OH 



R2' C N 

(CH2)p 



35 0 >^-<CH 2 ) q -NH 2 
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wherein B, X, Z, R 1# R 2 , p and q are as defined above, and 
optionally coupling a dye label F to the terminal amino 
group . 
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